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Abstract SARS coronavirus is an RNA virus whose rep¬ 
lication is error-prone, which provides possibility for escape 
of host defenses, and even leads to evolution of new viral 
strains during the passage or the transmission. Lots of varia¬ 
tions have been detected among different SARS-CoV strains. 
And a study on these variations is helpful for development of 
efficient vaccine. Moreover, the test of nucleic acid 
characterization and genetic stability of SARS-CoV is im¬ 
portant in the research of inactivated vaccine. The whole 
genome sequences of two SARS coronavirus strains after 
passage in Vero cell culture were determined and were com¬ 
pared with those of early passages, respectively. Results 
showed that both SARS coronavirus strains have high ge¬ 
netic stability, although nearly 10 generations were passed. 
Four nucleotide variations were observed between the sec¬ 
ond passage and the 11th passage of Sinol strain for identi¬ 
fication of SARS inactivated vaccine. Moreover, only one 
nucleotide was different between the third passage and the 
10th passage of Sino3 strain for SARS inactivated vaccine. 
Therefore, this study suggested it was possible to develop 
inactivated vaccine against SARS-CoV in the future. 
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Severe acute respiratory syndrome (SARS), which 
first occurred in November, 2002, in Guangdong Province 
of China, is a life-threatening respiratory disease. By 
July 2, 2003, in the world, 8442 people were infected and 
814 people died, with a mortality rate as high as 9.6%, 
according to WHO (http://cmbi.bjmu.edu.cn/cmbidata/ 
SARS/WHO/0702.mlit). The SARS causative pathogen is 
a member of the family coronaviridae, which has a sin¬ 
gle-stranded positive-sense RNA genome. The sequenced 
full-length genome of SARS-CoV is about 30 kilo-bases 
in length and lias more than 10 open reading frames. And 
the genome organization is similar to that of other coro¬ 
navirus. However, phylogenetic analyses and sequence 


comparisons showed that SARS-CoV is not closely 
related to any of the previously characterized coronavi- 

rus [1 ~ 3] . 

Although having a number of unique features, 
SARS-CoV is most closely related to group 2 coronavirus, 
and was an early split-off from group 2 branch [4] . Our 
previous study also indicated that SARS-CoV did not 
resemble to any other known coronavirus, including IBV 
Beijing isolate which was once epidemic in China [5] . 
However, the early infection of SARS-CoV may have 
originated from wild animals. SCoV-like viruses were 
isolated from Himalayan palm civets and genomes were 
identical to those of human isolates, but a 29-nt sequence 
found in the animal isolates was not detected in most hu¬ 
man isolates. Other studies also showed that five strains 
from human contained this 29-nt segment, and 82-nt dele¬ 
tion or 415-nt deletion in the orf8 region were observed in 
some human isolates and an animal isolate 16,7] . 

SARS-CoV like HIV. is an RNA vims which is 
prone to high mutation rate for its error replication to es¬ 
cape host defenses. 127 base substitutions were identified 
among 14 SARS isolates, and 94 of them changed their 
amino acid sequences 18,9] . In addition. 299 single nucleo¬ 
tide variations were detected among 63 sequences. The 
neutral mutation rate of SARS-CoV is similar to that of 
known RNA vims and is about one-third of that for the 
human immunodeficiency virus [1 . Molecular epidemiol¬ 
ogical approach and sequencing analysis of SARS-CoV 
viruses isolated from 10 Taiwanese patients suggested a 
recent introduction of the vimses into human populations. 
And the estimated genome mutation rate was about 0.1 
per genome, demonstrating possibly one of the lowest 
rates among known RNA viruses 1101 . A full-length cDNA 
of the SARS-CoV Urbani strain was assembled and 
molecularly cloned SARS vimses were rescued by using 
five contiguous cDNAs fragments that span the entire 
genome. Recombinant vimses replicated as efficiently as 
wild type vims. And the full-length cDNA of the 
SARS-CoV allowed for the development and testing of 
candidate vaccines, especially for the development of live 
attenuated vaccine 1111 . In this work, the whole genome 
sequences of two SARS-CoV vimses after passage in 
Vero cell culture were determined to develop inactivated 
vaccine. 

1 Materials and methods 

( i) Vims. Two SARS-CoV strains were obtained 
from the Chinese Academy of Medical Sciences and Pe¬ 
king Union Medical College and renamed Sinol strain 
and Sino3 strain by Sino Biotech Co., Ltd. respectively. 
Both viral strains were originally isolated by Peking Un¬ 
ion Hospital from nasal sw abs of two SARS patients and 
were confirmed as SARS-CoV by PCR and electron mi¬ 
croscopy. And the complete genomes of them were se¬ 
quenced and submitted to the GenBank. named PUMC03 
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strain and PUMC02 strain respectively. In this study. 
PUMC03 strain (GenBank: AY357076) is the second 
passage of Sinol strain (Sinol-2 isolate), while PUMC02 
strain (GenBank: AY357075) is the third passage of Sino3 
strain (Sino3-3 isolate). 

Both SARS-CoV strains were propagated on Vero 
cells in a flask at 37°C until 75% cells showed cytopadiic 
effect. After three freeze-thaw cycles, viruses were har¬ 
vested and well characterized. The batch of vims was 
stored at -80 °C as seeds or reference viruses for neutrali¬ 
zation assay. Vero cells were obtained from American 
type culture collection (ATCC) and were grown at 37°C in 
minimum essential medium (MEM), containing 10% (n7v) 
fetal calf senun and 1% glutamine. Finally, the lldi pas¬ 
sage of Sinol strain (Sinol-11 isolate) and the 10th pas¬ 
sage of Sino3 strain (Sino3-ll isolate) were obtained after 
subsequent passage in Vero cell culture. Vims was col¬ 
lected. and Trizol LS reagent (Invitrogen, USA) was 
added for viral RNA extraction. All work was finished at 
a biosafety level 3. 

(ii) Viral RNA extraction and RT-PCR. Samples 
were removed from BSL-3 and viral genomic RNA was 
extracted under BSL-2 conditions. The random hexameric 
primer (Promega, USA) was used in cDNA first-strand 
synthesis. After cDNA first-strand was synthesized. 74 
specific primer pairs were used in the reaction of PCR. 
Rapid amplification of cDNA ends (RACE) was per¬ 
formed to capture the 3'-end and 5'-end of the viral ge¬ 
nome. Other three primers were designed for RACE (one 
for 3' RACE and the other two for 5' RACE). 

(iii) Sequencing and analysis. RACE products and 
some PCR products were cut from 1% agarose gels and 
purified using the GeneClean III Kit (Qbiogene). Others 
were purified by MultiScreen-PCR Plates (Millipore). 
Each PCR fragment was directly sequenced from inward 
and outward directions. And automated sequencing with 
BigDye Terminator kit (Perkin Elmer) was run on an ABI 
377 DNA sequencer. We used the Plired/ Plirap/ Consed 
package to process all the raw sequence reads for base 
calling, assembly, and editing. Sequence regions with 
poor quality were resequenced from purified PCR frag¬ 
ments. Aligmnents of genome sequences of SARS-CoV 
were perfonned with Clustal W or on the line (http:// 


www.ncbi.nlm.nih.gov/blast/bl2seq). Amino acid se¬ 
quences of seven proteins also aligned with Clustal 

X1.81. 

2 Results and discussion 

In total, for each isolate, 76 fragments were obtained 
from RT-PCR and RACE, winch span the entire genome. 
All component sequences were determined in duplicate 
and the average length of them was 420 bases. Therefore, 
for any region of the genome there were four-fold to 
six-fold coverage, and there were more than 20 bp for 
every overlap. Finally, the complete genome sequence 
was assembled and the genome of the 10th passage of 
Sino3 strain (Sino3-ll isolate. GenBank: AY485278) was 
29740 nucleotides in length, while that of the 11th pas¬ 
sage of Sinol strain (Sinol-11 isolate, GenBank: 
AY485277) was 29741 bp. Compared with Tor2 isolate 
(GenBank: NC 0004718). nucleotide deletions were kept 
in both viral strains during passage in cell culture (2-nt 
deletion in Sino3 strain. 3-nt deletion in Sinol strain, re¬ 
spectively). 

( i ) Comparison of genome sequences. After 
aligmnents of the full-length genomes of these SARS- 
CoV isolates were finished, both Sinol-2 isolate and 
Sino3-3 isolate were found to have 14-nt deletion at the 
end of the 5'-end of genome, when compared to those of 
other three SARS-CoV (Sino3-ll isolate. Sinol-11 isolate 
and Tor2 isolate). Except these deletions, no other dele¬ 
tion or insert was detected between early passage and late 
passage of two vims strains. We think that the 5'-end of 
Sinol-2 and Sino3-3 isolate genomes could not be cap¬ 
tured completely. In addition, four single nucleotide 
variations were detected between the second passage and 
the 11th passage of the Sinol strain. For Sino3 strain, the 
genome of passage3 differs that of passagelO only by one 
nucleotide. The reversion at nucleotide position 6574 
from G to A in Sinol-11 isolate might be the result of a 
back-mutational event occurring potentially during the 
passage of the vims. Other studies also showed that there 
existed few genetic variations between primary samples 
and viruses isolated after limited passages of cultures 
from two SARS patients. The two sequences from patient 
1 were completely identical, but few single nucleotide 
variations were detected between the two sequences from 


Table 1 Nucleotide sequence variations of the complete genome of five SARS-CoV isolates 
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G 

T 

T 
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G 
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G 

T 

T 

A 
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Nucleotide positions and organization of open reading frames based on Tor2 sequence. Pane represent nucleotide mutation occurred during the 
passage of viruses in cell culture. The bold letters stand for nucleotide difference from the Tor2 isolate. Dashes indicate a nucleotide deletion. 
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patient 2 [10] , showing a high genetic stability during the 
course of passage in Vero cell culture. 

On the other hand, more or less variations were ob¬ 
served in the transmission of SARS. Genomic sequencing 
for viral isolates from five Amoy Gardens patients were 
finished and the vims sequence was identical in four of 
these five patients, but two single nucleotide variations 
existed in sequence from patient 2 [121 . However, studies of 
five SARS isolates from an index patient, three primary 
contacts of the index patient and one secondary' contact 
related to the index patient, suggested that more mutations 
occurred during the transmission to escape from host 
immune response. Two sequences from primary contacts 
were found to have 5-nt deletion and 6-nt deletion in a 
non-coding region, respectively. Besides the deletion, 
there were nucleotides substitutions among die five se- 
quences [8] . Vims passage in cell culture has fewer muta¬ 
tions than the transmission. Because of a stable condition 
in cell culture and non-selective pressure from host, the 
mutations of each point in the viral genome are low and 
similar, which result in the most variations for orflab , for 
s, then for n and so on. In our study, four of five single 
nucleotide variations were found in the region encoding 
Pplab which contains 2/3 the length of the whole genome. 
And the remainder 20% was in tire region encoding S 
glycoprotein. 

(a) s 


(ii) Comparison of amino acid sequences for major 
proteins. Aligmnents of the amino acid sequences for 
seven major proteins (Pplab, S, 3a. 3b, E. M, N) were 
performed and the variations were found (5 for Pplab, 2 
for S, 3 for E. lfor 3b, M, and none for 3b, N, respec¬ 
tively). The variations of S glycoprotein. E protein and M 
protein are shown in Fig. 1. 

Among five amino acid variations in the polyprotein, 
three of them located in Orfla, and the others lay in Orflb. 
As far as S glycoprotein is concerned, the amino acid se¬ 
quence of SI subunit is more variable than that of S2 
subunit and more variations were detected in SI. Align¬ 
ments of 14 different sequences of S indicated that most 
of the amino acid mutations in S glycoprotein are located 
in the SI subunit (1—680) [13] . In this study, we also found 
that both amino acid variations in S glycoprotein lay in SI 
domain. In addition, the 3-nt deletion in the Sinol strain 
genome caused an amino acid deletion in E protein. And 
tiie 2-nt deletion in the Sino3 strain genome was in the 
region encoding 8b protein and caused a stop of transla¬ 
tion. Deletions occurred in the orf8 region with no appar¬ 
ent effect on the survival of the vims, suggesting that this 
region is either noncoding or coding for a functionally 
unimportant putative protein 11 . On the other hand, five 
base substitutions during the passage of vims in cell cul¬ 
ture also changed the amino acid sequence, also showing 
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Sino3-11 QTGVI 
Sino3-3 QTGVI. 
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Tor2 MTSFVSBBTGTLIVNSVLLFLAFWPLLVTLAILTALRLCAYCCNIVNVSLVE PTVYVYSRVKNLNSS«GVPgLLV 76 
Sinol-11 MYSFVSBBTGTLI-NSVLLFLAPWPLLVTLAILTALRLCAYCCYIVNVSLVKPTVYVYSRVKNLNSSBGVPBLLV 75 
Sinol-2 MYSFVSBB'rGTLI-NSVLLFLAFWFLLVTLAILTALRLCAYCCYIVNVSLVKPTVYVYSRVKHLNSSBGVpjjLLV 75 
Sino3-ll MYSPPSBETGTLIVNSVLLPLAFWPLLVTLAILTALRLCAYCCNIVNVSLVKPTVYVYSRVIOILNSSEGVPDLLV 76 
Sino3- 3 MYSFFSBBTGTLIVNSVLLFLAFWFLLVTLAILTALRLCAYCCNIVNVSLVKPTVYVYSRVKNLNSSBGVPDLLV 7 6 
ruler 1.10........20.30..40.50.60..70. 


(C) M 


Tor2 MADNGTITVBBLKQLLBQWHLVIGFLFLAWIMLLQFAYSNRNRFLYIIKLVPLWLLWPVTLACFVLAAVYRINWVTGGIAIAMACIVGLMWLSYPVASPR 100 
Sinol-11 MADNGTITVBBLKQLLBQWHLVIGPLCLAWIMLLQPAYSHRNRPLYIIKLVPLWLLWPVTLACPVLAAVYRINWVTGGIAIAMACIVGLMWLSYPVASPij 100 
Sinol-2 MAONGTITVBBLKQLLBQWNLVIGFLCLAWIMLLQFAYSHRNRFLYIIKLVPLWLLWPVTLACPVLAAVYRINWVTGGIAIAMACIVGLHWLSYPVASPR 100 
Sino3-11 MAONGTITVBBLKQLLBQWNLVIGPLCLAWIMLLQPAYSNRNRFLYIIKLVPLWLLWPVTLACPVLAAVYRINWVTGGIAIAMACIVGLMWLSYFVASPR 100 
Sino3-3 MApWGTITVBELKQLLBQWIILVIQPLCLAWIMLLQPAYSNRNRPLYlIKLVPLWLLWPVTLACPVLAAVYRIWWVTGGIAIAMAClVGLMWLSYPVASPlj 100 



Fig. 1. Distribution of amino acid variations of S glycoprotein (a), E protein (b) and M protein (c). Only partial sequences of S protein and M protein 
with variations were shown here. Asterisks above the sequence indicate that all amino acids were identical. Ruler under the sequence was used to de¬ 
termine the position of amino acid. 
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that the rates of nonsynonymous changes were greater 
than those of synonymous changes for the s gene se¬ 
quences and orfla sequences 17 ’ 10] . 

Although amino acid sequences of two SARS-CoV 
isolates were changed after passage, no signs showed re¬ 
markable change in its immunogenicity and antigenicity. 
And the viral titer was at the same level before passage. 
In all, SARS-CoV passage in cell culture has high stabil¬ 
ity. 

3 Conclusion 

A number of variations were found among separate 
SARS-CoV viruses. A study of these mutations will help 
develop effective and safe vaccines 1 8] . Moreover, the 
low mutation rate of SARS-CoV suggested diat develop¬ 
ment of a vaccine against the SARS-CoV in future would 
be less difficult than expected before [7,10] . 

In addition, in this study, the Sino3 strain was used 
to develop inactivated vaccine, and die Sinol strain was 
used for testing the candidate vaccine. It was very impor¬ 
tant to know die genetic stability of diem. The complete 
genomes of these two SARS-CoV isolates, which repas¬ 
saged in Veto cell culture, were sequenced and analyzed. 
And the result indicated a high genetic stability for 
SARS-CoV passage in Vero cell culture, which implies 
that there is high possibility to develop inactivated vac¬ 
cine against the SARS-CoV. 
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